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RAW SEQUENCE LISTING 

PATENT APPLICATION : US/09/866,925 



DATE: 
TIME: 



10/24/2001 
14:35:30 



C--> 



3197 



3308 



3204 



1 
3 
4 
5 
7 
9 
11 
13 
15 
18 
19 
20 
21 
23 
24 
25 

28 
29 
30 
31 
34 
35 
36 
37 
39 
40 
41 

44 
45 
46 
47 
51 
52 
53 
54 
56 
57 
58 

61 
62 
63 
64 
65 
68 
69 



Input Set : A:\3124-Z.txt 

Output Set: N:\CRF3\10242001\I866925.raw 

<110> APPLICANT: Feldmann, Richard J. 

<120> TITLE OF INVENTION: ALGORITHMIC DETERMINATION OF FLANKING DNA SEQUENCES THAT 

CONTROL THE EXPRESSION OF SETS OF GENES IN PROKARYOTIC , 

ARCHEA AND EUKARYOTIC GENOMES 
<130> FILE REFERENCE: 3124 -Z 

<140> CURRENT APPLICATION NUMBER: US 09/866,925 
<141> CURRENT FILING DATE: 2001-05-30 

<160> NUMBER OF SEQ ID NOS : 24 9 
<170> SOFTWARE: Proprietary 
<210> SEQ ID NO: 1 
<211> LENGTH: 175 
<212> TYPE: DNA 
<213> ORGANISM: E. Coli 
<220> FEATURE: 

<222> LOCATION: ( 3939065 )...( 393923 9 ) 

<223> OTHER INFORMATION: Chromosome = 1 Strand = positive ConnectronOb jectNumber = 










attattttaa atttcctctt 
acggaacaac ggcaaacacg 
aaagcaaaaa taaatgcttg 



<400> SEQUENCE: 1 
aaaaaatgcg cggtcagaaa 
tataatgcgc caccactgac 
tcctgagaac tccggcagag 
<210> SEQ ID NO: 2 
<211> LENGTH: 175 
<212> TYPE: DNA 
<213> ORGANISM: E . Coli 
<220> FEATURE: 

<222> LOCATION: ( 4032781 )...( 4032955 ) 
<223> OTHER INFORMATION: Chromosome = 1 



gtcaggccgg aataactccc 
ccgccgggtc agcggggttc 
actctgtagc gggaa 



60 
120 
175 



Strand = positive ConnectronOb jectNumber = 



<4 00> SEQUENCE: 2 

taaatttcct cttgtcaggc cggaataact ccctataatg 

aacggcaaac acgccgccgg gtcagcgggg ttctcctgag 

aaataaatgc ttgactctgt agcgggaagg cgtattatgc 

<210> SEQ ID NO: 3 

<211> LENGTH: 186 

<212> TYPE: DNA 

<213> ORGANISM: E. Coli 

<220> FEATURE: 

<222> LOCATION: ( 3939657 )...( 3941012 ) 

<223> OTHER INFORMATION: Chromosome = 1 Strand 



cgccaccact gacacggaac 
aactccggca gagaaagcaa 
acaccccgcg ccgct 



60 
120 
175 



= positive ConnectronOb jectNumber = 



<4 00> SEQUENCE: 3 
gatgtgccca gatgggatta 
tagctggtct gagaggatga 
ggaggcagca gtggggaata 
tatgaa 

<210> SEQ ID NO: 4 
<211> LENGTH: 186 



gctagtaggt 
ccagccacac 
ttgcacaatg 



ggggtaacgg ctcacctagg 
tggaactgag acacggtcca 
ggcgcaagcc tgatgcagcc 



cgacgatccc 
gactcctacg 
atgccgcgtg 



60 
120 
180 
186 
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70 <212> TYPE: DNA 

71 <213> ORGANISM: E. Coli 
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RAW SEQUENCE LISTING DATE: 10/24/2001 

PATENT APPLICATION: US/09/866 f 925 TIME: 14:35:30 

Input Set : A:\3124-Z.txt 

Output Set: N:\CRF3\10242001\I866925.raw 



3206 



3223 



3225 



3228 



3301 



73 <220> FEATURE: 

74 <222> LOCATION: ( 3941057 )...( 3941609 ) 

75 <223> OTHER INFORMATION: chromosome « 1 Strand = positive ConnectronOb jectNumber = 

78 <400> SEQUENCE: 4 

79 gtccccttcg tctagaggcc caggacaccg ccctttcacg gcggtaacag gggttcgaat 60 

80 cccctagggg acgccacttg ctggtttgtg agtgaaagtc acctgcctta atatctcaaa 120 

81 actcatcttc gggtgatgtt tgagatattt gctctttaaa aatctggatc aagctgaaaa 180 

82 ttgaaa 186 

85 <210> SEQ ID NO: 5 

86 <211> LENGTH: 186 

87 <212> TYPE: DNA 

88 <213> ORGANISM: E. Coli 

90 <220> FEATURE: 

91 <222> LOCATION: ( 3943852 )...( 3944312 ) 

92 <223> OTHER INFORMATION: Chromosome = 1 Strand = positive ConnectronOb jectNumber = 

95 <400> SEQUENCE: 5 

96 gctgaagtag gtcccaaggg tatggctgtt cgccatttaa agtggtacgc gagctgggtt 60 

97 tagaacgtcg tgagacagtt cggtccctat ctgccgtggg cgctggagaa ctgagggggg 120 

98 ctgctcctag tacgagagga ccggagtgga cgcatcactg gtgttcgggt tgtcatgcca 180 

99 atggca 186 

102 <210> SEQ ID NO: 6 

103 <211> LENGTH: 144 

104 <212> TYPE: DNA 

105 <213> ORGANISM: E . Coli 

107 <220> FEATURE: 

108 <222> LOCATION: ( 3944314 )...( 3944450 ) 

109 <223> OTHER INFORMATION: Chromosome = 1 Strand - positive ConnectronOb jectNumber = 

112 <400> SEQUENCE: 6 

113 aaacagaatt tgcctggcgg ccgtagcgcg gtggtcccac ctgaccccat gccgaactca 60 

114 gaagtgaaac gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac 120 

115 tgccaggcat caaattaagc agta 144 

118 <210> SEQ ID NO: 7 

119 <211> LENGTH: 112 

120 <212> TYPE: DNA 

121 <213> ORGANISM: E . Coli 

123 <220> FEATURE: 

124 <222> LOCATION: ( 3944469 )...( 3944573 ) 

125 <223> OTHER INFORMATION: Chromosome = 1 Strand = positive ConnectronOb jectNumber = 

128 <400> SEQUENCE: 7 

129 ggtcataaaa ccggtggttg taaaagaatt cggtggagcg gtagttcagt cggttagaat 60 

130 acctgcctgt cacgcagggg gtcgcgggtt cgagtcccgt ccgttccgcc ac 112 

133 <210> SEQ ID NO: 8 

134 <211> LENGTH: 57 

135 <212> TYPE: DNA 

136 <213> ORGANISM: E. Coli 

138 <220> FEATURE: 

139 <222> LOCATION: ( 4024935 )...( 4025088 ) 

140 <223> OTHER INFORMATION: Chromosome = 1 Strand = positive ConnectronOb jectNumber = 
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143 <400> SEQUENCE: 8 
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RAW SEQUENCE LISTING 

PATENT APPLICATION : US/09/866, 925 



DATE: 10/24/2001 
TIME: 14:35:30 



3307 



3327 



3307 



3432 



144 
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150 
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166 
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168 
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173 
174 
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176 
177 
180 
181 
182 
183 
185 
186 
187 

190 
191 
192 
193 
194 
195 
196 
199 
200 
201 
202 
204 
205 
206 

209 
210 
211 
212 



Input Set : A:\3124-Z.txt 

Output Set: N:\CRF3\10242001\I866925.raw 



ttatcgtgcc tacaaatagt ccgaaccgta ggccggataa 

<210> SEQ ID NO: 9 

<211> LENGTH: 56 

<212> TYPE: DNA 

<213> ORGANISM: E . Coli 

<220> FEATURE: 

<222> LOCATION: ( 4032754 )... (4034681 ) 

<223> OTHER INFORMATION: Chromosome = 1 Strand 

<400> SEQUENCE: 9 

tgccggatgc ggcgtaaacg ccttatccgg cctacggttc 

<210> SEQ ID NO: 10 

<211> LENGTH: 347 

<212> TYPE: DNA 

<213> ORGANISM: E. Coli 

<220> FEATURE: 

<222> LOCATION: ( 4038097 )...( 4038215 ) 

<223> OTHER INFORMATION: Chromosome = 1 Strand 



ggcgtttacg ccgcatc 



57 



= positive ConnectronObjectNumber = 



ggactatttg taggca 



56 



= positive ConnectronObjectNumber = 



<400> SEQUENCE: 10 
aaaaaatgcg cggtcagaaa 
tataatgcgc caccactgac 
tcctgagaac tccggcagag 
attatgcccg tcacaccatg 
agggcgctta ccactttgtg 
ggaacctgcg gttggatcac 
<210> SEQ ID NO: 11 
<211> LENGTH: 347 
<212> TYPE: DNA 
<213> ORGANISM: E. Coli 
<220> FEATURE: 
<222> LOCATION: (4032754 
<223> OTHER INFORMATION: 



attattttaa 
acggaacaac 
aaagcaaaaa 
ggagtgggtt 
attcatgact 
ctccttacct 



atttcctctt 
ggcaaacacg 
taaatgcttg 
gcaaaagaag 

ggggtgaagt 

taaagaagcg 



gtcaggccgg 
ccgccgggtc 
actctgtagc 
taggtagctt 
cgtaacaagg 
ttctttg 



aataactccc 
agcggggttc 
gggaaggcgt 
aaccttcggg 
taaccgtagg 



60 
120 
180 
240 
300 
347 



) . , . (4034681) 
Chromosome = 1 



Strand = positive ConnectronObjectNumber = 



<400> SEQUENCE: 11 
aaaaaatgcg cggtcagaaa 
tataatgcgc caccactgac 
tcctgagaac tccggcagag 
attatgcccg tcacaccatg 
agggcgctta ccactttgtg 
ggaacctgcg gttggatcac 
<210> SEQ ID NO: 12 
<211> LENGTH: 335 
<212> TYPE: DNA 
<213> ORGANISM: E. Coli 
<220> FEATURE: 
<222> LOCATION: (416387 
<223> OTHER INFORMATION 

<400> SEQUENCE: 12 
tgcgcggtca gaaaattatt 
gcgccaccac tgacacggaa 
gaactccggc agagaaagca 



attattttaa 
acggaacaac 
aaagcaaaaa 
ggagtgggtt 
attcatgact 
ctccttacct 



atttcctctt 
ggcaaacacg 
taaatgcttg 
gcaaaagaag 
ggggtgaagt 
taaagaagcg 



gtcaggccgg 
ccgccgggtc 
actctgtagc 
taggtagctt 
cgtaacaagg 
ttctttg 



aataactccc 
agcggggttc 
gggaaggcgt 
aaccttcggg 
taaccgtagg 



60 
120 
180 
240 
300 
347 



8) . . . (416579 
: Chromosome 



3) 
= 1 



Strand = positive ConnectronObjectNumber = 



ttaaatttcc tcttgtcagg 
caacggcaaa cacgccgccg 
aaaataaatg cttgactctg 



ccggaataac tccctataat 
ggtcagcggg gttctcctga 
tagcgggaag gcgtattatg 



60 
120 
180 
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2218 



812 



882 



813 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/866,925 
Input Set : A:\3124-Z.txt 

Output Set: N:\CRF3\10242001\I866925.raw 



DATE: 10/24/2001 
TIME: 14:35:30 



213 cacaccacac 

214 cttaccactt 

215 tgcggttgga 



218 <210> 

219 <211> 

220 <212> 

221 <213> 

223 <220> 

224 <222> 

225 <223> 



catgggagtg 
tgtgattcat 
tcacctcctt 
13 



ggttgcaaaa 
gactggggtg 
accttaaaga 



gaagtaggta 
aagtcgtaac 
agcgt 



gcttaacctt cgggagggcg 240 
aaggtaaccg taggggaacc 300 

335 



SEQ ID NO: 
LENGTH: 72 
TYPE: DNA 
ORGANISM: E. Coli 
FEATURE : 

LOCATION: (2729433) . . . (2729505) 
OTHER INFORMATION: Chromosome = 1 



Strand = negative ConnectronObjectNumber = 



228 <400> SEQUENCE: 13 

229 cttgtcaggc cggaataact ccctataatg cgccaccact 

230 acgccgccgg gc 

233 <210> SEQ ID NO : 14 

LENGTH: 43 
TYPE: DNA 

ORGANISM: H. Pylori 
FEATURE : 

LOCATION: (1062106) . . . (1062148) 
OTHER INFORMATION: Chromosome = 1 Strand 



gacacggaac aacggcaaac 60 

72 



234 <211> 

235 <212> 

236 <213> 

238 <220> 

239 <222> 
<223> 



240 



243 <400> SEQUENCE: 14 

244 ttttactcat agggttttta tagttcctag cggaactaaa 

247 <210> SEQ ID NO: 15 

248 <211> LENGTH: 43 

TYPE : DNA 

ORGANISM: H. Pylori 
FEATURE : 

LOCATION: (1158533) . . . (1158575) 
OTHER INFORMATION : Chromosome = 1 Strand 



= positive ConnectronObjectNumber = 



gca 



43 



249 <212> 

250 <213> 

252 <220> 

253 <222> 

254 <223> 



= positive ConnectronObjectNumber = 



257 <400> SEQUENCE: 15 

258 tagcggaact aaagcattca 
261 <210> SEQ ID NO: 16 



tcccaaacac taaagatatt tgg 



43 



262 <211> 

263 <212> 

264 <213> 

266 <220> 

267 <222> 

268 <223> 



LENGTH: 70 
TYPE : DNA 

ORGANISM: H. Pylori 
FEATURE : 

LOCATION: (1062106) . . . (1062175) 
OTHER INFORMATION: Chromosome = 1 



Strand = positive ConnectronObjectNumber = 



271 <400> SEQUENCE: 16 

272 ttttactcat agggttttta tagttcctag cggaactaaa 

273 agatatttgg 



gcattcatcc caaacactaa 60 

70 



276 <210> 

277 <211> 

278 <212> 

279 <213> 

281 <220> 

282 <222> 

283 <223> 



SEQ ID NO: 17 
LENGTH: 70 
TYPE : DNA 

ORGANISM: H. Pylori 
FEATURE : 

LOCATION: (1158506) . . . (1158575) 
OTHER INFORMATION: Chromosome = 1 



Strand = positive ConnectronObjectNumber = 



881 
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286 <400> SEQUENCE: 17 



file://C:\CRF3\Outhold\VsrI866925.htm 



10/24/01 



Page 8 of 11 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/866, 925 



DATE: 10/24/2001 
TIME: 14:35:30 



Input Set 
Output Set 



A:\3124-Z.txt 

N:\CRF3\10242001\I866925.raw 



287 ttttactcat agggttttta 

288 agatatttgg 



tagttcctag cggaactaaa gcattcatcc caaacactaa 



60 
70 



291 <210> SEQ ID NO: 18 

292 <211> LENGTH: 70 

293 <212> TYPE: DNA 

294 <213> ORGANISM :. H. Pylori 

296 <220> FEATURE: 

297 <222> LOCATION: (1062106) ...( 1062175 ) 

298 <223> OTHER INFORMATION : Chromosome = 1 Strand = positive ConnectronObjectNumber » 

813 

301 <400> SEQUENCE: 18 

302 ttttactcat agggttttta tagttcctag cggaactaaa gcattcatcc caaacactaa 60 

303 agatatttgg 70 

306 <210> SEQ ID NO: 19 

307 <211> LENGTH: 70 

308 <212> TYPE: DNA 

309 <213> ORGANISM: H. Pylori 

311 <220> FEATURE: 

312 <222> LOCATION: ( 1158506 )...( 1158575 ) 

313 <223> OTHER INFORMATION: Chromosome = 1 Strand = positive ConnectronObjectNumber * 

881 

316 <400> SEQUENCE: 19 

317 ttttactcat agggttttta tagttcctag cggaactaaa gcattcatcc caaacactaa 60 

318 agatatttgg 70 

321 <210> SEQ ID NO: 20 

322 <211> LENGTH: 56 

323 <212> TYPE : DNA 

324 <213> ORGANISM: H. Pylori 

326 <220> FEATURE: 

327 <222> LOCATION: ( 1614783) ... (1614838 ) 

328 <223> OTHER INFORMATION: Chromosome = 1 Strand = positive ConnectronObjectNumber = 

1241 

331 <400> SEQUENCE: 20 

332 ttttactcat agggttttta tagttcctag cggaactaaa gcattcatcc caaaca 56 

335 <210> SEQ ID NO: 21 

336 <211> LENGTH: 37 

337 <212> TYPE: DNA 

338 <213> ORGANISM: S. Cervesiae 

340 <220> FEATURE: 

341 <222> LOCATION: ( 802985 )...( 803022 ) 

342 <223> OTHER INFORMATION: Chromosome = 4 Strand = positive ConnectronObjectNumber = 

1352 

345 <400> SEQUENCE: 21 

346 ttatgagaag ctgtcatcga agttagagga agctgaa 37 

349 <210> SEQ ID NO: 22 

350 <211> LENGTH: 362 

351 <212> TYPE: DNA 

352 <213> ORGANISM: S. Cervesiae 

354 <220> FEATURE: 

355 <222> LOCATION: ( 876188 )...( 876255 ) 

356 <223> OTHER INFORMATION: Chromosome = 4 Strand = negative ConnectronObjectNumber = 

1416 

359 <400> SEQUENCE: 22 
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L ttagatcta 



60 

ttacattatg ggtggtatgt tggaataaaa a 



lt caactatc atctactaac 
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DATE: 10/24/2001 
VERIFICATION SUMMARY TIME: 14:35:31 

PATENT APPLICATION: US/09/866,925 

mrmt Set • A:\3124-Z.txt 

OuSt St: N:\CRF3\10242001\I866925.raw 

differs. Replaced Current Filing Date 
11 M:271 C: Current Filing Date differs, Rep 
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